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TRACKING DOWN THE SOURCE POPULATION RESPONSIBLE FOR THE UNRESOLVED COSMIC 

6-8 KEV BACKGROUND 

Y. Q. Xue 1 ' 2 ' 3 , S. X. Wang 1 ' 2 , W. N. Brandt 1 ' 2 , B. Luo 1 ' 2 , D. M. Alexander 4 , F. E. Bauer 5 ' 6 , A. Comastri 7 , 
A. C. Fabian 8 , R. Gilli 7 , B. D. Lehmer 9 ' 10 , D. P. Schneider 1 ' 2 , C. Vignali 11 , and M. Young 1 ' 2 

ABSTRACT 

Using the 4 Ms Chandra Deep Field-South (CDF-S) survey, we have identified a sample of 6845 
X-ray undetected galaxies that dominates the unresolved sa 20-25% of the 6-8 keV cosmic X-ray 
background (XRB). This sample was constructed by applying mass and color cuts to sources from 
a parent catalog based on GOODS-South HST z-band imaging of the central 6'-radius area of the 
4 Ms CDF-S. The stacked 6-8 keV detection is significant at the 3.9a level, but the stacked emission 
was not detected in the 4-6 keV band which indicates the existence of an underlying population of 
highly obscured active galactic nuclei (AGNs). Further examinations of these 6845 galaxies indicate 
that the galaxies on the top of the blue cloud and with redshifts of 1 < z < 3, magnitudes of 
25 < Z850 < 28, and stellar masses of 2 x 10 8 < M*/M© < 2 x 10 9 make the majority contributions 
to the unresolved 6-8 keV XRB. Such a population is seemingly surprising given that the majority 
of the X-ray detected AGNs reside in massive (> 10 10 M Q ) galaxies. We discuss constraints upon 
this underlying AGN population, supporting evidence for relatively low-mass galaxies hosting highly 
obscured AGNs, and prospects for further boosting the stacked signal. 

Subject headings: surveys — galaxies: active — diffuse radiation — X-rays: diffuse background — 
X-rays: galaxies 



1. INTRODUCTION 

Deep extragalactic X-ray surveys have been effective in 
finding active galactic nuclei (AGNs) out to z sa 5 (see, 
e.g., Brandt & Hasinger 2005 and Brandt & Alexander 
2010 for reviews), with an observed AGN sky density 
approaching 14,900 deg~ 2 achieved by the deepest X-ray 
surveys, the Chandra Deep Fields (e.g., Brandt et al. 
2001; Giacconi et al. 2002; Alexander et al. 2003; Bauer 
et al. 2004; Luo et al. 2008; Xue et al. 2011, here- 
after Xll; Lehmer et al. 2012). In both broad (0.5-2 and 
2-8 keV) and narrow (0.5-1, 1-2, 2-4, 4-6, and 6-8 keV) 
energy bands, ~ 75-95% of the cosmic X-ray background 
(XRB) emission has been resolved into discrete sources 
(e.g., Brandt et al. 2000; Mushotzky et al. 2000; Bauer 
et al. 2004; Worsley et al. 2005; Hickox & Markevitch 
2006; Luo et al. 2011), the majority of which are moder- 
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ately to highly obscured AGNs (e.g., Barger et al. 2003; 
Szokoly et al. 2004; Tozzi et al. 2006). Of particular 
interest is the remaining unresolved XRB at the high- 
est energies accessible to Chandra, 6-8 keV, since XRB 
synthesis models (e.g., Gilli et al. 2007) indicate that 
much of this emission should arise from the highly ob- 
scured AGNs that contribute strongly to the XRB near 
its « 20-40 keV peak. 

Recently, utilizing the 4 Ms Chandra Deep Field-South 
(CDF-S; Xll) data, Luo et al. (2011) found that the 
resolved 6-8 keV XRB fraction is 78% ± 6%, tak- 
ing into account both the X-ray source contribution and 
bright-end correction, and adopting the XRB normaliza- 
tion from Hickox & Markevitch (2006; the XRB has a 
power-law spectral slope of T = 1.4 and a normalization 
of 10.9 photons s" 1 keV -1 sr" 1 at 1 keV)E3 Luo et al. 
(2011) further found that the unresolved « 20-25% of 
the XRB in the 6-8 keV band can plausibly be explained 
by the stacked emission (a 2.5<r signal) from a sample 
of 18,272 X-ray undetected GOODS-South (GOODS-S) 
HST z-b&nd sources. The above resolved fraction should 
be considered cautiously as it is known that there is cos- 
mic variance, likely arising from the underlying large- 
scale structure variations between fields, in the deepest 
Chandra surveys (e.g., Barger et al. 2002, 2003; Cowie 
et al. 2002; Gilli et al. 2003, 2005; Yang et al. 2003; 
Hickox & Markevitch 2006; Silverman et al. 2010). The 
resultant uncertainty in the XRB normalization is likely 
in the range 10-20% (e.g., Hickox & Markevitch 2006). 
In this field Luo et al. (2011) obtained the above 6-8 keV 
stacked signal and it is of interest to understand its ori- 
gin. Luo et al. (2011) also showed that there should be 

12 Throughout this paper, the resolved 6-8 keV XRB fraction 
refers to the ratio between the detected 6—8 keV surface brightness 
in the field and the 6—8 keV XRB level determined by Hickox & 
Markevitch (2006). 
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an underlying population of highly obscured AGNs hid- 
den among the aforementioned X-ray undetected galax- 
ies. Thanks to superb sensitivities, ultradeep X-ray ob- 
servations have already been able to reveal a significant 
fraction of such previously "missing" highly obscured 
AGNs (e.g., Alexander et al. 2011; Comastri et al. 2011; 
Feruglio et al. 2011; Gilli et al. 2011; Xll) that are esti- 
mated to be roughly as numerous as moderately obscured 
AGNs (e.g., Gilli et al. 2007). 

Mounting evidence has shown that luminous AGNs 
tend to reside in massive (i.e., M* > 10 10 M©; M* is 
stellar mass) and red galaxies from the local universe up 
toz« 3-4 (e.g., Barger et al. 2003; Bundy et al. 2008; 
Brusa et al. 2009; Silverman et al. 2009; Xue et al. 2010, 
hereafter X10; Mullaney et al. 2012). In this paper, we 
thus focus on using these mass and color constraints as 
clues to hunt for an underlying population of highly ob- 
scured AGNs responsible for the unresolved « 20-25% of 
the 6-8 keV XRB. This paper is structured as follows: in 
§[21 we describe how we estimated physical properties for 
sources of interest; in § [3] we present the source-stacking 
analysis and the results obtained; in § [4] we assess the 
robustness of the stacking results; and in §[5] we discuss 
the implications of the results. 

Throughout, a cosmology of Hq = 70.4 km s _1 Mpc^ 1 , 
ft m = 0.272, and fl\ — 0.728 is adopted (e.g., Komatsu 
et al. 2011). Unless stated otherwise, apparent magni- 
tudes are given in the AB system (Oke & Gunn 1983), 
and rest-frame absolute magnitudes are given in the Vega 
system (Johnson & Morgan 1953). We adopt a Galactic 
column density of Ah = 8.8 x 10 19 cm~ 2 (e.g., Stark et 
al. 1992) along the line of sight to the CDF-S and cor- 
rect for Galactic extinction in all relevant X-ray analyses 
below. 

2. SOURCE PROPERTIES 

In this section we describe briefly how we estimated 
source properties, i.e., redshifts, rest-frame absolute 
magnitudes, and stellar masses, for sources of interest. 
The Luo et al. (2011) sample of 18,272 X-ray unde- 
tected GOODS-S HST z-band sources is located within 
6' of the 4 Ms CDF-S average aim point (aj2000 = 
03 h 32 m 28.06 s , <5j 2 000.0 = -27°48'26.4") and outside 
of twice the 90% encircled-energy (in the 0.5-2 keV 
band) aperture radius of any 4 Ms CDF-S main-catalog 
sourcf3 (the resultant total stacking area is 0.027 deg 2 ). 
As shown in Luo et al. (2011), this sample appears to be 
responsible for the unresolved ~ 20-25% of the 6-8 keV 
XRB. Discarding the 160 sources with a low signal-to- 
noise ratio that were not included in the Dahlen et al. 
(2010) catalog (detailed below) and the 77 stars that were 
spectroscopically identified therein (see § 2.3 of Dahlen 
et al. 2010 for the references of the spectroscopic data 
used), we reduce the size of the above Luo et al. (2011) 
sample to 18,035 and refer to this reduced sample as 

13 As described in Xll, the 4 Ms CDF-S main catalog contains 
740 X-ray sources that are detected with WAVDETECT at a false- 
positive probability threshold of 10~ 5 in at least one of three X-ray 
bands (0.5-8 keV, full band; 0.5-2 keV, soft band; and 2-8 keV, 
hard band) and also satisfy a binomial-probability source-selection 
criterion of P < 0.004 (i.e., the probability of sources not being 
real is less than 0.004). The flux limits at an off-axis angle of 
6' for the 4 Ms CDF-S are ss 1.2 x 10" 16 , 3.1 x 10" 17 , and 2.6 x 
10~ 16 erg cm -2 s — 1 for the full, soft, and hard bands, respectively. 



"Sample A" hereafter. The properties and contribution 
to the 6-8 keV XRB for Sample A are listed in Table [TJ 
which shows that Sample A still appears to be responsi- 
ble for the unresolved 6-8 keV signal seen by Luo et al. 
(2011; see § [3] for the details of the stacking procedure). 
We provide in Table [2] resolved XRB fractions in various 
bands between 0.5 and 8 keV for Sample A and additional 
samples of interest (see § [3]) . We also directly illustrate 
the values in Table [2] as Fig. [TJ 

It can be inferred from Fig. Q] (i.e., the top- most 
summed data points shown as squares) that there should 
be a yet-to-be-identified source population that accounts 
for the remaining < 6 keV emission without boosting 
significantly the 6-8 keV emission. As shown in § [3J 
this missing source population cannot be associated with 
individual galaxies, which would otherwise have been 
stacked already. It is likely that this remaining < 6 keV 
emission is from groups/clusters (e.g., Cappelluti et al. 
2012), whose emission would not be included in our 
stacking of galaxies and whose spectrum often has a 
strong thermal cutoff thus contributing emission in the 
soft band but not much in the hard band. It is also likely 
that cosmic variance might play some role here (e.g., 
affecting the shape of the summed spectrum shown in 
Fig. [I} . Given the complexities in determining resolved 
XRB fractions (e.g., adopting various methodologies and 
different XRB normalizations), the resolved XRB frac- 
tions reported in Table [2] (i.e., summed contributions of 
X-ray sources and bright-end correction that range from 
~ 75% to 80%) are in general agreement with those from 
other works. For example, Hickox & Markevitch (2006) 
found the resolved XRB fractions to be 77 ± 3% and 
80 ± 8% for the 1-2 and 2-8 keV bands, respectively; 
Lchmer et al. (2012) obtained resolved XRB fractions of 
76 ± 4%, 82 ± 13%, 88 ± 14%, and 82 ± 9% for the 0.5-2, 
2-8, 4-8, and 0.5-8 keV bands, respectively. 



2.1. Redshifts 

Sample A has a 5cr z-band limiting magnitude of 28.1, 
much deeper than most of the photometric-redshift cat- 
alogs available in this field (e.g., Cardamone et al. 2010; 
Rafferty et al. 2011), and thus 40% of the Sample A 
sources (most with z§50 > 26) have no photometric- 
redshift estimates in those catalogs. Recently, Dahlen 
et al. (2010) derived photometric redshifts (z p hot's) 
for the 32,508 GOODS-S z-band sources in the entire 
GOODS-S region, which include all the 18,035 Sample A 
sources. We chose to re-derive z p hot's for these 32,508 
sources in order to ensure consistency of our analyses 
here (i.e., using an optimized comprehensive set of spec- 
tral energy distribution templates throughout; see below) 
and also include the latest CANDELS HST/WFC3 pho- 
tometry. 

We used the ultradeep 12-band photometry and 1382 
secure spectroscopic redshifts (z spec 's) assembled by 
Dahlen et al. (2010; see their § 2.3 for how the qual- 
ity flag for z spcc was assigned). The 12-band photome- 
try, covering a wavelength range of ~ 0.3-8.0 /jm in the 
observed frame, consists of the VLT/VIMOS [/-band; 
HST/ ACS F435W, F606W, F775W, and F850LP bands; 
VLT/ISAAC J, H, and K s bands; and Spitzer/IRAC 
3.6, 4.5, 5.8, and 8.0/xm bands. The photometry (includ- 
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Table 1 

Stacked 6-8 keV Properties 









Median 


Median 


Median 


Net 


S/N 


Band 




Total 


Resolved 


Sample 


Criteria 




z 


Af* (M ) 


C e ff 


Counts 


w 


Ratio 


r a a 


Intensity 


Fraction (%) 


(i) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) ' 


(12) 


A 


All galaxies within 6' of aim point 


18,035 


1.10 


2.3 X 10 8 


-0.53 


819±343 


2.4 


0.44±0.08 


1.67±0.16 


1.27 


26.1±10.9 


B 


Mir > 2 X 10 s M Q 


9542 


1.48 


8.9 x 10 8 


-0.45 


841±250 


3.4 


0.46±0.07 


1.63±0.13 


1.32 


26.9±8.0 


C 


-0.85 < C aS < 


12,290 


1.13 


2.6 x 10 8 


-0.51 


799±283 


2.8 


0.47±0.10 


1.62±0.20 


1.24 


25.4±9.0 


D 


M, > 2 x 10 s Af e & -0.85 < C cff < 


6845 


1.59 


8.1 x 10 8 


-0.48 


820±212 


3.9 


0.48 ± 0.08 


1.60 ±0.16 


1.28 


26.2±6.8 



Note. — Col. (1): Sample of galaxies used for stacking. Samples B-D are subsets of Sample A. Col. (2): Criteria used to define the 
stacked sample (see §[3] for the definition of C e g). Col. (3): Number of galaxies in the stacked sample. Col. (4): Median redshift of the 
stacked sample. Col. (5): Median stellar mass of the stacked sample. Col. (6): Median effective rest-frame color of the stacked sample. 
Col. (7): Stacked net source counts in the 6—8 keV band, with lcr Gaussian statistical errors. Col. (8): Stacked signal-to-noise ratio in 
the 6-8 keV band. Col. (9): Stacked band ratio, defined as the ratio between the observed 2—8 keV and 0.5—2 keV count rates. The 
lcr errors were calculated following the "numerical method" described in §1.7.3 of Lyons (1991). Col. (10): Effective photon index with 
lcr errors of the stacked sample. Col. (11): Total 6-8 keV intensity (in units of 10 — 12 erg cm -2 s — 1 deg — 2 ) of the stacked sample. We 
calculated effective photon indices and fluxes based on band ratios and aperture-corrected count rates using the CXC's Portable Interactive 
Multi-Mission Simulator. Col. (12): Resolved fraction of the 6—8 keV XRB from the stacked sample. We adopted the XRB normalization 
from Hickox & Markevitch (2006); see §[T] 



Table 2 

Resolved XRB Fractions in Various Bands 





0.5-1 keV 


1-2 keV 


2-4 keV 


4-6 keV 


6-8 keV 


Sample 


Resolved Frac. (%) 


Resolved Frac. (%) 


Resolved Frac. (%) 


Resolved Frac. (%) 


Resolved Frac. (%) 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


A 


9.2 ± 0.8 


7.6 ±0.4 


5.8 ±0.9 


< 6.0 


26.1 ± 10.9 


B 


7.9 ± 0.6 


6.8 ±0.3 


4.8 ±0.7 


< 4.4 


26.9 ±8.0 


C 


5.7 ±0.7 


5.0 ±0.3 


3.3 ±0.8 


< 4.9 


25.4 ±9.0 


D 


5.0 ±0.5 


4.6 ±0.3 


3.1 ±0.6 


< 3.7 


26.2 ±6.8 


X-ray sources 


21.0 ±5.3 


26.5 ±5.1 


38.1 ± 5.2 


43.4 ±5.2 


47.7 ±5.5 


Bright-end correction 


58.9 ±4.2 


47.8 ±3.4 


38.9 ±2.7 


33.1 ± 2.3 


29.8 ±2.1 


X-ray src. + corr. + D 


84.8 ±6.7 


78.9 ±6.1 


80.1 ± 5.9 


76.5±|f 


103.7 ±9.1 



by X-ray point sources in the 4 Ms CDF-S and the corresponding bright-end correction (Luo et al. 2011; also see §0. The last row is 
the sum of contributions from X-ray point sources, the bright-end correction, and Sample D. Cols. (2-6): Resolved XRB fractions and lcr 
uncertainties in various bands. In Col. (5), the "<" signs for Samples A, B, C, and D indicate 3<r upper limits on resolved fractions in 
the 4-6 keV band, where the upper limit for Sample D was used to determine the upper error of the total contribution in this band. We 
adopted the XRB normalization from Hickox & Markevitch (2006); see §[T] 



ing upper limit J^T) was obtained using the TFIT algo- 
rithm that performs point-spread-function matched pho- 
tometry uniformly across different instruments and filters 
(see Dahlen ct al. 2010 for details). Additionally, we 
also included the latest photometry in the HST/WFC3 
F105W (Y), F125W (J), and F160W (H) bands, based 
on the first 10-epoch GOODS-S images from CANDELS 
(Grogin et al. 2011; Koekemoer et al. 2011) that are 
publicly available. For the CANDELS Deep and Wide 
regions that have multi-epoch J and H coverage, the 
images were stacked for each band in each region us- 
ing the published weight maps by the CANDELS group. 
Object catalogs were generated using SExtractor (Bertin 
& Arnouts 1996) version 2.8.6, and then source match- 
ing was performed with the Dahlen et al. (2010) cata- 
log using the SExtractor ASSOC option, searching for 
the nearest match within 0.5". The J-band photometry 
was done with the SExtractor single-image mode, while 
the iY-band photometry was extracted with the dual- 
image mode to match detections in J. The photometry 
on Y-band images was extracted only using the single- 
image mode because of its different sky coverage from 
J and H images. For sources that have J and H de- 

14 For photometry reported by TFIT with a negative flux value 
or a positive flux value that is equal or smaller than its error <jf (i.e., 
with a < 1 signal-to- noise ratio), we incorporate this information as 
a flux upper limit (i.e., with coverage but no detection) by setting 
both the values of flux and its error to be <7f . 



tections in both the FST/WFC3 and VLT/ISAAC, we 
found the two sets of J and H measurements in reason- 
able agreement without any apparent systematic offsets; 
for these sources, we chose to adopt the measurements 
from HST/WFC3 and discard those from VLT/ISAAC 
because the HST/WFC3 photometry is of higher quality. 
The 5fj limiting magnitudes for point sources are the fol- 
lowing: 28.0 for the VIMOS [/-band; 28.7, 28.8, 28.3, and 
28.1 for the ACS F435W, F606W, F775W, and F850LP 
bands; 25.0, 24.5, and 24.4 for the ISAAC J, H, and K s 
bands; 26.1, 25.5, 23.5, and 23.4 for the IRAC 3.6, 4.5, 
5.8, and 8.0/im bands (see § 2.1 of Dahlen et al. 2010 for 
details); and 28.2, 27.9, and 27.6 for the WFC3 F105W, 
F125W, and F160W bands (see Table 6 of Grogin et al. 
2011 for details), respectively. 

Utilizing the Zurich Extragalactic Bayesian Redshift 
Analyzer (ZEBRA; Feldmann et al. 2006), we adopted 
a procedure similar to that detailed in Luo et al. (2010) 
and Rafferty et al. (2011) to calculate Zphot's down to 
z 85 o 28.1. As is standard practice, we constructed 
our galaxy spectral energy distribution (SED) templates 
based on the stellar population synthesis model by 
Bruzual & Chariot (2003) with a Chabrier initial-mass 
function (IMF; Chabrier 2003) and a dust-extinction law 
from Calzetti et al. (2000). The adopted star- formation 
history is of exponential form, e~'/ T , with log(r/year) 
ranging from 6.5 to 11.0 and log(age/year) ranging from 
7.0 to 10.1 (both in steps of 0.1). The dust extinction 
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Figure 1. Resolved XRB fractions in five energy bands between 0.5 and 8 keV; this figure illustrates the values in Table [2] (details are 
therefore referred to Table [2] cf. Fig. 6 of Luo et al. 2011). The data points are shifted slightly in the x-direction for clarity. The total 
XRB intensities are adopted from Hickox & Markevitch (2006; also see § [TJ with uncertainties indicated by the gray area. 



Ay varies between and 3.0 (also in steps of 0.1), and 
the metallicities are Z = 0.004, 0.008, 0.02 (roughly so- 
lar), and 0.05. Using the available 1382 secure z spec 's 
(redshifts were fixed to the z apcc values for training pur- 
poses), we first ran ZEBRA to identify and apply sys- 
tematic offsets in the photometry (differing from filter to 
filter; typically < 0.3 mag) that minimized the residuals 
between observed and best-fit template fluxes. We then 
used ZEBRA to construct new templates by modifying 
the original templates based on the best fits between the 
corrected photometry and original templates. Finally we 
ran ZEBRA on all 32,508 sources, using the corrected 
photometry and an improved set of templates, to derive 

Zphot S. 

We used the normalized median absolute deviation 



cnmad = 1-48 x median 



Az — mcdian(Az) 



1 + z. 



spec 



. (1) 



where Az = z p hot — z spoc , and the outlier fraction [out- 
liers are defined as sources with |Az/(l + z spC c)| > 0.15] 
to assess the z p hot quality. For the spectroscopic sub- 
sample, we find ctnmad = 0.005 and an outlier fraction 
of 1.8%. However, the above evaluation cannot repre- 
sent the real quality of the z p hot's, because the SED 
templates were modified using the z spcc information, and 
we are thus biased to get optimal fitting results for the 
spectroscopic subsample. Therefore, to obtain a realis- 
tic estimate of the z p hot quality for the sources lacking 
^spec's (i.e., no training possible), we performed a series 
of blind tests. We randomly selected 3/4 of the z spcc 
sources to go through the above training procedure (i.e., 
photometry correction and template improvement). We 
then derived z p hot's for the remaining 1/4 z spcc sources 
(i.e., blind-test sources whose z spcc information was not 



utilized) based on the corrected photometry and the ex- 
panded set of templates. The blind test was repeated 
ten times to ensure a statistically meaningful assessment, 
which means that there are duplicated blind-test sources 



because a 



'spec 



source will on average be used for blind 



testing 2.5 times. Figure [5] shows the z p hot quality results 
from the blind tests. We obtained ctnmad = 0.043 and 
an outlier fraction of 7.1% for the blind tests. 

We defined c^ MAD and ctnmad *° examine further the 
•Zphot accuracy as a function of Zsso magnitude, where 
°nmad ^ s calculated for sources with z p hot > z spec using 
Eq. [T] and c^mad f° r sources with z p hot < z spec . The 
""nmad an d "nmad running curves are shown in Fig. [2^ 
(red and blue curves) , both of which are roughly constant 
(« ±0.03) and symmetric around the Az/(1 + z spcc ) = 
axis (dashed line) across a wide range of zsso magnitude. 
Therefore, our z p hot quality appears to be reasonably 
accurate and free of strong systematics down to faint 
magnitudes, as can also be inferred from Figs. [2}d-[2J1 that 
show the histograms of Az/(l + z spoc ) in various intervals 
of Z850 magnitude. 

Strictly speaking, the above blind-test analysis of z p hot 
quality is really only applicable for Zsso ^25.2 mag (the 
rough limit of the spectroscopic data available), given 
that some of the bandpasses used in the analysis have 
dropped toward fainter magnitudesEB To explore effec- 
tively the true behavior of z p hot quality at fainter magni- 
tudes, we performed four additional series of blind tests 

15 We examined the mean number of detection bands (Nfn teI ) 
as a function of zgso magnitude for all the 32,508 sources in the 
Dahlen et al. (2010) catalog (cf. Fig. 8b of Luo et al. 2010). We 
find that Nfn teI is no less than « 10 for sources with zgso < 25 mag, 
while Alitor drops from S3 10 to ss 7 as zgso goes from i=a 25 mag 
to 28 mag. 
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Figure 2. Blind-test results of photometric redshifts for all sources with z spe c (i.e., including duplicate ones), (a) Az/ (I + z Bpcc ) as a 
function of z-band magnitude. The cri. 



blue curves, respectively, (b— d) Histograms of Az/(1 



and o"j^ MAD running curves (both computed in bins of Azgso = 1 mag) are shown as red and 
cc) in various intervals of zsso magnitude, with corresponding values of o"J MAD 
and c^ MAD annotated. In each of the four panels (a-d), the number of all sources is shown without parentheses and the number of unique 
sources (i.e., excluding duplicate ones) is shown in parentheses. The dashed line indicates Az/(l + z spcc ) = 0, and the dotted lines indicate 
the threshold values of outliers [i.e., Az/(1 + z spe c) = ±0.15]. 



(denoted as cases i-iv) that are almost identical to the 
previous blind tests with the only difference being the uti- 
lization of "faked" photometry in four different ways. We 
faked the photometry of the z spcc subsample as follows. 



of a 



NMAD/ "NMAD 



For each 



'spec 



source, we first randomly picked a faint 



(i.e., Z850 > 25 mag), non-z spec source either from the 
Dahlen et al. (2010) catalog (a total of 32,508 sources) 
or from Sample D (a total of 6845 sources, whose stacked 
6-8 keV emission can account entirely for the unresolved 
« 20-25% of the 6-8 keV XRB; see §[3] for details). We 
then applied the band coverage of the randomly picked 
Zphot source to the z spec source. Specifically, for each fil- 
ter considered, (1) if the z p hot source was not observed, 
we then set the z spcc source as non-observed; (2) if the 
•Zphot source was not detected (i.e., upper limits applied), 
we then either set the z spcc source as non-observed (this 
corresponds to the worst scenario where all information 
was discarded) or set the z spoc source as non-detected 
(i.e., we added a random la fluctuation to the flux of 



the 



'spec 



source to simulate the photometric quality of 



the z p hot source and treated the derived flux as an up- 
per limit); and (3) if the z p hot source was detected, we 
then did nothing with the photometry of the z spec source. 
The various combinations of parent sample (the Dahlen 
et al. catalog vs. Sample D) and photometry treatment 
of the S pec source (non-observed vs. non-detected, when 
the Zphot source was not detected) lead to cases i-iv. 
Table [3] shows the z p hot quality blind-test results for 
cases i-iv where "faked" photometry was utilized. As 
an example, Figure [3] shows the results for case iv. Over- 
all, the obtained ctnmad ranges from 0.049 to 0.055 (cf. 
cnmad = 0.043 obtained in previous blind tests) and the 
outlier fraction ranges from 10.4% to 13.1% (cf. an out- 
lier fraction of 7.1% obtained previously). These analyses 
suggest that in our case the reduction of bandpass cov- 
erage at Z850 > 25 mag does degrade the z p hot quality to 
some degree, which is expected, but not severely overall. 

We also made different versions of Figs. [5] and [3] using 
only z spcc > 1 sources. We find that these versions resem- 
ble the original Figs. [5] and [3] closely in terms of values 



and outlier fractions. This analysis 
shows that there is no apparent degradation of our z p hot 
quality toward high redshifts. 

We then compared our z p hot's with other photometric- 
redshift catalogs in this and other fields. In general, our 
z P hot quality (in terms of ctnmad and outlier fraction) is 
consistent with that of Cardamone et al. (2010), Dahlen 
et al. (2010), Luo et al. (2010), Rafferty et al. (2011), 
and Salvato et al. (2011) at similar magnitudes (the 
first four catalogs have CDF-S coverage while the fifth 
one is in the COSMOS field). Further source-to-source 
comparison with any of the four CDF-S catalogs re- 
veals no strong systematic difference in the z p hot's at 
any magnitudes, and the typical difference in the z p hot 
estimates is at the same level as the reported error bars 
[median(|z photi xue - z P hot,other|) ~ er zphot ~ 0.15]. 

Overall 779 out of the 18,035 sources (4.3%) in Sam- 
ple A have secure z spoc 's, while Sample D contains 
537 sources (7.8%) with a secure z spcc out of its total 
6845 sources. For Sample D sources, we find accept- 
able agreement between our z p hot's and those of Dahlen 
et al. (2010) (ctnmad = 0.080), despite the different 
methodologies adopted and the challenging nature of de- 
riving z p hot 's for these faint sources (see §[3J). 

2.2. Rest-Frame Absolute Magnitudes 

We followed the procedure in § 3.1 of X10 to derive 
rest-frame absolute magnitudes for each Sample A source 
up to its reddest rest- frame detection band (A"-, H-, 
J-, 7-, R-, V-, or S-band). Briefly, we adopted the 
approach of template SED fitting, which has the ad- 
vantage of potentially reducing catastrophic failures in 
cases of limited/incomplete photometric coverage, as op- 
posed to the approach of linear or log-linear interpola- 
tion/extrapolation based on photometric data. With the 
input of the aforementioned Dahlen et al. (2010; 12-band 
photometry) and CANDELS (Y, J, and H) photometry 
as well as the improved set of templates (see § 12.11) , we 
utilized ZEBRA to identify the best-fit template for each 
source by fixing the source redshift to the corresponding 
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Table 3 

Zphot Blind- Test Results with the Utilization of Faked Photometry 



Case 


Parent Sample 


Photometry Treatment 


CNMAD 


Outlier Fraction 


i 


Dahlen et al. catalog 


Non-observed 


0.055 


13.1% 


ii 


Dahlen et al. catalog 


Non-detected 


0.053 


11.5% 


iii 


Sample D 


Non-observed 


0.051 


11.3% 


iv 


Sample D 


Non-detected 


0.049 


10.4% 



0.4 



0.2 



0.0 



-0.2 



-0.4 



(a) all 

3450 sources 
(1 292 sources) , 



(b) z 850 <24 
2851 sources 
( 1 066 sources) 




(c) 24<z 850 <25 
463 sources 
(1 77 sources) 



(d) z 850 >25 
1 36 sources 
(49 sources) 




□ 



16 18 20 22 24 26 
z-band (ACS F850LP) magnitude 



100 300 500 20 60 100 5 15 25 
N N N 



Figure 3. Same as Figure [2] but derived with the utilization of faked photometry (corresponding to case iv; see § 12.11 1. 



°spcc 



(if available) or z p hot- We then derived rest-frame 
absolute magnitudes for each source based on the best-fit 
template. Dust extinction is folded into our galaxy SED 
templates (see § 12.11) , so the derived rest- frame absolute 
magnitudes are not extinction corrected. 



2.3. Stellar Masses 

We adopted the approach described in § 3.2 of 
X10 to derive stellar masses (M*) for the Sample A 
sources. Using the tight correlations between rest- frame 
optical/near- infrared colors and stellar mass-to- light ra- 
tios obtained by Zibetti et al. (2009), 

log(M Ai */M ) = log(L A /L A , Q )+6 A (M B -A/v)+aA+0.20, 

(2) 

we estimated a set of stellar masses at various rest-frame 
bands (denoted as A) for each source (the values of the 
coefficients, a A and b\, can be found in Table Bl of Zi- 
betti et al. 2009). The above equation was derived by 
constructing spatially resolved maps of stellar-mass sur- 
face density in galaxies, based on the high-quality opti- 
cal and near-infrared imaging data of a sample of nine 
nearby galaxies that span a broad range of morphologies 
and physical properties (Zibetti et al. 2009). When de- 
riving Eq. [2J Zibetti et al. (2009) took into account the 
effects of dust in their models. Thus, L\ and Mb — My in 
Eq. [2] are the observed (dust-extincted) luminosity and 
rest-frame color. A caveat pointed out by Zibetti et al. 
(2009) is that stellar masses of dusty starburst galaxies 
estimated using unresolved photometry are likely under- 
estimated by up to 40% because dusty regions are under- 
represented in the measured fluxes. In Eq. [5] we have 
adjusted the normalization by 0.20 dex to account for 
our adopted Salpeter (1955) IMF for stellar-mass esti- 



mates{3 We selected the stellar-mass estimate that cor- 
responds to the actual reddest rest-frame detection band 
of the sourc j 17 ! because longer-wavelength (e.g., K-band) 
galaxy luminosities are much less sensitive to dust and 
stellar-population effects than shorter-wavelength lumi- 
nosities (e.g., Bell & de Jong 2000). 

Using simulations, we assessed the uncertainties associ- 
ated with stellar-mass estimates that arise from our pro- 
cedure for deriving photometric redshifts and rest-frame 
absolute magnitudes based on template SED fitting. For 
each Sample A source, which has a photometric redshift 
^phot, i and an associated 1-sigma error <7 p hot,i) we ran- 
domly drew a value z S j m ,i (i.e., the simulated photometric 
redshift) from the range of Zphot.i ± fphot.i conservatively 
assuming a uniform distribution. We then derived the 
simulated rest-frame absolute magnitudes for the source 
using Zsim.i, the aforementioned photometry, and the im- 
proved set of templates (see § 12. ip . following the pro- 
cedure detailed in § 12.21 Finally, we obtained a simu- 
lated stellar-mass estimate for the source using Eq. [3J 
For Sample A sources, we found no systematic offset be- 
tween the set of simulated stellar-mass estimates (M* jS i m ) 
and the set of real stellar-mass estimates (M*), i.e., the 
distribution of the logarithmic ratio between these two 
sets of stellar-mass estimates [R — log(M* !S j m /M*)] is 
symmetric and peaks at zero; furthermore, the scatter 
of R is 0.22 dex and largely independent of stellar mass. 
Given that the photometric-redshift errors derived with 

16 In this paper, we have adopted a conversion factor of 
fa 0.20 dex (i.e., tts 1.6) between stellar masses estimated using a 
Salpeter IMF and a Chabrier IMF (with the former stellar masses 
being systematically larger). 

17 Of the Sample A (Sample D) sources, 94.7%, 79.5%, and 
57.0% (94.2%, 83.2%, and 59.8%) have rest-frame ij-band, J-band, 
and -K-band detections (detection indicates a > la signal; see Foot- 
note [T4j or beyond, respectively. 
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ZEBRA generally underestimate the true errors by a fac- 
tor of « 3-6 (see, e.g., § 3.4 of Luo et al. 2010), we 
repeated the above simulation four times by randomly 
drawing z s i m! ; from the range of z p hot,i ± ^cr p hot,i (where 
n = 3, 4, 5, and 6) assuming a uniform distribution. In 
these four additional simulations, we also found no sys- 
tematic offset between M* s i m and M* values; the scatter 
of R is 0.40, 0.45, 0.50, and 0.56 dex for n = 3, 4, 5, and 
6, respectively. We expect to have smaller scatters in 
R if we randomly draw z s j m! i assuming a Gaussian dis- 
tribution that peaks at 2 p hot,i> which is likely closer to 
reality. The above analyses show that the stellar-mass 
errors produced by the uncertainties of photometric red- 
shifts and rest-frame absolute magnitudes are typically 
smaller than w 0.2-0.5 dex. 

We assessed the robustness of our stellar-mass esti- 
mates through several checks. First, we compared our 
stellar-mass estimates with those presented by X10 and 
Mullaney et al. (2012). For all sources in X10 and Mul- 
laney et al. (2012), we find general agreement between 
common sources, with a median ratio of « 1.0 between 
the two estimates (after taking into account different 
choices of IMFs and rest-frame bands that are used for 
stellar-mass estimates) and < 0.35 dex random scatter. 
Second, we compared our galaxy stellar-mass distribu- 
tions to those in the COSMOS field. Ilbert et al. (2010) 
computed the stellar masses of the COSMOS galaxies 
where sources with i + < 25.0 have the most reliable 
photometric redshifts and mass estimates. For each of 
the chosen Subaru i + -band limiting magnitudes (i.e., 
i+ m . t = 22.5, 23.0, 23.5, 24.0, 24.5, and 25.0)0 the distri- 
butions of our stellar masses and the Ilbert et al. (2010) 
stellar masses are generally similar, with comparable me- 
dian stellar masses and Kolmogorov-Smirnov test proba- 
bilities ranging from 8.0% to 49.4% that indicate similar 
stellar-mass distributions. Third, we compared our color- 
mass distribution with that in Peng et al. (2010). Fig- 
ure 2] is our color-mass diagram, which is in parallel with 
the two bottom panels in Fig. 4 of Peng et al. (2010). The 
color-mass bimodality feature and the distribution of the 
sources in the color- mass plane in our Fig.|4]are very simi- 
lar to those in Fig. 4 of Peng et al. (2010). Finally, we also 
estimated stellar masses utilizing the Fitting and Assess- 
ment of Synthetic Templates (FAST; Kriek et al. 2009) 
package that is based on galaxy SED fitting. We adopted 
the same stellar population synthesis model, IMF, dust- 
extinction law, star-formation history, and metallicity 
ranges as those described in § 12.11 to ensure consistency 
between estimates of z p hotj rest-frame absolute magni- 
tudes, and stellar masses. We found that stellar masses 
calculated by FAST are consistent with those based on 
Zibetti et al. (2009) after taking into account different 
choices of IMFs, with an RMS of < 0.4 dex, which is 
the typical precision of such methods. Throughout this 
paper, we have chosen to adopt stellar masses based on 
Zibetti et al. (2009) that are more directly related to 
source colors and rest-frame absolute magnitudes and 

18 For each Sample A source, we utilized a JC-correction pack- 
age (kcorrect.v4_l_4; Blanton & Roweis 2007) to convert the 2850 
magnitude into the i + magnitude by convolving the best-fit SED 
template of the source with the zgso and i + filter curves and com- 
puting the differences between the derived zg$o and i+ magnitudes 
(typically |z 850 - i + \ < 0.5 mag). 
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Figure 4. Color-mass distribution plots, for direct comparison 
with the two bottom panels in Fig. 4 of Peng et al. (2010) to 
evaluate the validity of our mass estimates. The y-axis is rest-frame 
U — B color (converted into the AB magnitude system), and the 
x-axis is the logarithm of our mass estimate reduced by 0.2 dex to 
adjust for the offset between our IMF (Salpeter) and the Chabrier 
IMF adopted in Peng et al. (2010). The solid line is the division 
between red and blue galaxies used by Peng et al. (2010), which 
is a function of U — B, mass, and redshift (see their Eq. 2). The 
sources used in these two panels are in the same redshift intervals 
as used in the two Peng et al. (2010) panels, and we also applied 
a magnitude cut of i < 22.5 since their catalog is flux-limited at 
/ < 22.5. 



thus less dependent on model and parameter choices. We 
have verified that the same basic results presented below 
can be obtained by using stellar masses calculated with 
FAST. 

3. ANALYSIS AND RESULTS 

In Table 0] we present the derived source properties 
of the 18,035 Sample A sources. As discussed in § [1] 
mounting evidence has shown that luminous AGNs tend 
to reside in massive (i.e., M* > 10 10 M ) and red galax- 
ies over at least the last ~ 80% of cosmic history, i.e., 
z k, 0-4 (e.g., Barger et al. 2003; Bundy et al. 2008; 
Brusa et al. 2009; Silverman et al. 2009; X10; Mullaney 
et al. 2012). Therefore, we utilize these mass and color 
constraints as clues to search for the underlying popu- 
lation of luminous but highly obscured AGNs that are 
responsible for the unresolved rj 20-25% of the 6-8 keV 
XRB. 

Figure [5^ shows the effective color vs. mass diagram 
for Sample A (shown as a density map overlaid with con- 
tours), where the effective color is defined as 

C cS = (U- V) iest + 0.312 + 0.08M V + 0.51. (3) 

In Eq. [3l (U — V) Tes t is the rest-frame U — V color (i.e., 
Mtj — My), z is the redshift, and My (Mtj) is the rest- 
frame absolute U-band ([/-band) magnitude. The defini- 
tion of C e ff is based on the equation separating galaxies 
into the red sequence and the blue cloud of Bell et al. 
(2004), who studied the color distribution of « 25,000 
R < 24 galaxies with 0.2 < z < 1.1. Taking into account 
a typical color scatter of < 0.2 mag for the red-sequence 
color-magnitude relation (see § 4 of Bell et al. 2004), the 
Bell et al. equation separates blue and red galaxies rea- 
sonably well down to fainter magnitudes out to z sa 3-4 
(e.g., X10), and thus we use C c g to define whether a 
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Table 4 

Derived Properties for the Sources in Sample A 



No. 


RA 


DEC 


•^spec 


-^phot 


^•ph,low 


■^ph,up 


Mtj 


Mb 


M v 


-^reddest 


^reddest 


log(M*/M ) 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 


9000 


53.11829 


-27.86707 


-1.000 


1.620 


1.548 


1.707 


-17.88 


-17.98 


-18.42 


-20.15 


6 


8.88 


9001 


53.11829 


-27.84127 


-1.000 


0.610 


0.594 


0.670 


-15.07 


-14.97 


-15.59 


-19.27 


7 


8.60 


9002 


53.11832 


-27.81530 


-1.000 


0.550 


0.533 


0.665 


-15.67 


-14.98 


-15.21 


-17.07 


7 


7.26 


9003 


53.11834 


-27.72375 


-1.000 


0.395 


0.362 


0.613 


-14.85 


-15.07 


-15.63 


-17.97 


7 


8.02 


9004 


53.11835 


-27.70741 


-1.000 


0.750 


0.605 


0.863 


-15.94 


-15.77 


-15.90 


-16.42 


4 


7.60 



Notes. The full table contains 18,035 entries and 20 columns for each entry. Columns: (1) Source sequence number (from 1 to 18035). 
(2, 3) J2000 right ascension and declination (in degrees). (4) Spectroscopic redshift (—1.000 indicates no spectroscopic redshift 
available). (5) Photometric redshift. (6, 7) 1-cr lower and upper limits on photometric redshift. (8, 9, 10) Rest-frame absolute 
U-, B-, and V-band magnitude (Vega mags). (11) Rest-frame absolute magnitude (Vega mags) that corresponds to the reddest 



rest-frame coverage. The conversion between Vega and AB rest-frame absolute magnitudes is: A/ab 



Mvo, 



where 



m conv = 0.628/ - 0.102/0.029/0.264/0.501/0.914/1.381/1.839 for rest-frame U/B/V/R/I/ J/H/K-hanA, respectively; we derived these 
Ticonv values using a K-correction package (kcorrect.v4_l_4; Blanton & Roweis 2 007) . (12) Flag of reddest rest-frame coverage. 
This flag shows which stellar-mass estimate is adopted as the final estimate (see § 12.30 . Frcddcst = (1, 2, 3, 4, 5, 6, 7) means that 
M-eddest = ^b/v/R/i/J/h/k ant l ^B/v/R/l/j/H/K,* i g adopted, respectively. (13) Stellar-mass estimate adopted in this paper. (14) 
Stellar-mass calculated by FAST. (15, 16) CANDELS V-band magnitude and associated 1-cr uncertainty (AB mags; —99.00 indicates 
no photometry available for this filter). (17, 18) CANDELS J-band magnitude and associated 1-cr uncertainty (AB mags). (19, 20) 
CANDELS //-band magnitude and associated 1-cr uncertainty (AB mags). 

(This table is available in its entirety in a machine-readable form in the online journal. A portion is shown here for guidance regarding its 
form and content.) 

der et al. 2011)E3 As expected, the vast majority of 
these sources are massive and on the red sequence, the 
green valley, or the top of the blue cloud. 

We then proceeded to stack different sub-groups of 
Sample A to investigate which sources produce the ma- 
jority contribution to the unresolved ~ 20-25% of the 
6-8 keV XRB. We adopted the same stacking procedure 
as detailed in § 3.1 of Luo et al. (2011). Briefly, to- 
tal counts (including background) for each individual 
source were extracted from an aperture 3" in diame- 
ter centered on its optical position. Background counts 
for each source were estimated by taking the mean of 
the counts within 1000 apertures (also with 3" diam- 
eter each), which were randomly placed within a 1'- 
radius circle around the source avoiding any known X-ray 
source (i.e., outside of twice the 0.5-2 keV 90% encircled- 
energy aperture radius of any 4 Ms CDF-S main-catalog 
source). Stacked counts [total (S) or background (£?)] 
were the summation of counts from the stacked sample 
with proper aperture correction applied. The net source 
counts are then given by S—B, and the S/N is calculated 
as (S — If)/*/!! where Gaussian statistics are assumed 
given the large values of S and B. 

Motivated by the fact that most of the obscured AGNs 
(and AGN candidates) appear in the massive and rela- 
tively red corner of Fig. [5^, we divided Sample A into var- 
ious stellar-mass and effective-color bins and stacked the 
sources in each bin. Figures. [6^ andEb show the stacking 
results, and Table [T] gives some detailed stacking results 
(e.g., stacked net counts, stacked signal-to-noise ratio, 
effective photon index, and resolved 6-8 keV XRB frac- 
tion) for some cases of interest. It seems clear that the 
ps 20-25% unresolved 6-8 keV XRB mostly lies in galax- 
ies with Al* > 2 x 10 8 M Q (i.e., Sample B; see Tabled]), 
in particular in the bin of 2 x 10 8 < M*/M < 2 x 10 9 



galaxy in Sample A is red or no10 given that the sources 
in Sample A span a wide range in redshift and luminos- 
ity. As shown in Fig. [5^, there is a correlation between 
stellar mass and effective color (with large scatter) such 
that more massive galaxies are generally redder (i.e., hav- 
ing larger C e ff values), consistent with previous results. 
The normalized histograms of stellar mass and effective 
color are shown in Figs.[5jD and[5j; (black histograms), re- 
spectively. For comparison, three additional samples of 
highly obscured AGNs or AGN candidates are also plot- 
ted on Fig. [5ji. The sample labeled with dark-green filled 
circles consists of 47 highly obscured AGNs at z f=s 0.5-3 
detected in the 6-8 keV band in the central 6' area of 
the 4 Ms CDF-SE3 (their mass and color histograms are 
shaded in dark green in Figs.[5jD andEh; Xll). The points 
labeled with crosses are a sample of 23 highly obscured 
AGN candidates at z « 0.5-1 that were X-ray undetected 
and selected by their infrared star-formation rate (SFR) 
excess (i.e., infrared-based SFRs being a factor of >3.2 
higher than SFRs determined from the UV after correct- 
ing for dust extinction; Luo et al. 2011). The points 
labeled with diamonds are a sample of 11 AGNs at z s» 2 
that were K < 22 BzK-selected galaxies and identified 
as highly obscured using the 4 Ms CDF-S data (Alexan- 



19 Galaxies in the red sequence, the green valley, and the blue 
cloud have C aS > 0.05, -0.05 < C cft < 0.05, and C eS < -0.05, re- 
spectively, given that 0.05 is the typical "half-width" of the green 
valley in a color-magnitude diagram (e.g., Nandra et al. 2007; 
X10). By definition, we would expect the C G ff distribution to be 
double-peaked (i.e., red and blue peaks), which is, however, not 
clearly seen in Fig. [5}: (black histogram) due to the dilution caused 
by color errors, uncertainties in redshift estimates, and large num- 
bers of low-mass blue galaxies. Indeed, if we consider only, e.g., 
galaxies with < z < 1 and M* > 10 9 5 Mq (as in X10), then 
color bimodality is clearly seen. 

20 These 47 AGNs are the 4 Ms CDF-S main-catalog sources 
that have an effective photon index of T e ff < 1.0 and satisfy a 
binomial-probability source-selection criterion of P < 0.004 in the 
6-8 keV band. In Xll, the P < 0.004 source-detection criterion 
was applied only in the 0.5—8, 0.5—2, and 2—8 keV bands; here we 
extended the use of this criterion for 6—8 keV source detection. 



21 The majority of the highly obscured AGNs mentioned here 
have Lrj.5-8 keV < 10 43 7 erg s — 1 , which indicates that their hosts 
dominate the optical-to-near infrared emission thus ensuring reli- 
able estimates of host stellar masses and colors (see § 4.6.3 of X10 
for details). 
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Figure 5. (a) Effective color-mass diagram for Sample A, which is shown as a density map overlaid with contours (the 8 contour levels 
are 3, 6, 9, 12, 15, 18, 21, and 24 sources per pixel). The large red rectangle highlights the region occupied by the Sample D sources. The 
two parallel, diagonal, long-dashed lines enclose a subsample of sources (discussed in § [4jl that lie within a diagonal stripe. For comparison, 
a sample of 47 highly obscured AGNs detected in the 6—8 keV band in the 4 Ms CDF-S (large dark green filled circles; Xll), a sample of 
23 X-ray undetected and infrared-selected highly obscured AGN candidates (crosses; Luo et al. 2011), and a sample of 11 highly obscured 
AGNs that were K < 22 i?2i\-selected galaxies (diamonds; Alexander et al. 2011) are also plotted (see §|3}. The division scheme of the 
red sequence, the green valley, and the blue cloud is illustrated on the left side, (b) Normalized (peaking at unity) stellar-mass histograms 
for Sample A (black histogram), Sample D (red shaded histogram), and the sample of 47 highly obscured, 6-8 keV detected AGNs (dark 
green shaded histogram; for clarity, we do not show the histograms for the aforementioned 23 highly obscured AGN candidates and 11 
highly obscured AGNs). (c) Same as Panel (b), but for normalized effective-color histograms. 



where a 2.8a signal was obtained. Moreover, the signal 
also mostly arises from the Sample A sources on the top 
of the blue cloud, i.e., the unresolved 6-8 keV XRB has 
major contributions from galaxies with —0.85 < C c ff < 
(i.e., Sample C; see Table [1]), in particular in the bin of 
—0.45 < Ccff < where a 2.9c signal was obtained. 

By applying both the mass and color constraints to- 
gether (i.e., selecting the common sources in Sample B 
and Sample C), we obtained Sample D (i.e., M* > 
2 x 1O 8 M and -0.85 < C off < 0; see Tabled}. There are 
6845 galaxies in Sample D that can account entirely for 
the unresolved « 20-25% of the 6-8 keV XRB (see Ta- 
blet!]), and the stacked 6-8 keV signal from these galaxies 
is significant at the 3.9er level (corresponding to a chance 
of p = 5.1 X 10 -5 that the signal was generated by Pois- 
son noise). The region occupied by the Sample D sources 
in the effective color-mass diagram is highlighted with 
a large red rectangle in Fig. [5p,, and their normalized 
stellar-mass and effective-color distributions are shown 
in Figs. EJd and [5fc, respectively. Additional information 
about the stacking results for various samples can be 
found in Table [2] and Fig. [Q 

Figure [7] shows the stacked 0.5-8 keV spectrum for the 
Sample D sources (open circles), with the stacked, adap- 
tively smoothed 6-8 keV image shown as the inset. The 



facts that the 6-8 keV flux is significantly elevated (com- 
pared to the fluxes in the lower-energy bands) and that 
there is no detection in the 4-6 keV band suggest that 
highly obscured AGNs dominate the stacked spectrum 
at high X-ray energies (see § 15.21 for more discussion) . 
The apparent inconsistency between the hard stacked 
spectrum (see Fig. [7|) and the relatively small band ratio 
(0.48, corresponding to T c g — 1.60; see Tablc[TJ) for Sam- 
ple D is due to the fact that the observed 2-8 keV count 
rate, which dilutes the 6-8 keV contribution, is used for 
the calculation. We stress that this observed spectral 
rise at the 6-8 keV band is not caused by our sample 
selection, based on the following analyses: (1) The ratio 
between the stacked 6-8 and 4-6 keV fluxes for the par- 
ent sample, i.e., Sample A, is larger than 9, meaning that 
the spectral rise observed in the stacked Sample D spec- 
trum is actually intrinsic to Sample A (see Fig. [T]); (2) 
We stacked Sample D and non-Sample D sources in the 
4-6 keV band respectively and do not find any difference 
between the average 4-6 keV fluxes for these two sam- 
ples; (3) We examined the resolved 4-6 keV XRB frac- 
tions for Sample A sources in various stellar-mass and 
effective-color bins (cf. Fig. [6]) and do not find any cor- 
relation or pattern between the resolved 4-6 keV XRB 
fractions and stellar masses/effective colors (unlike the 
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Figure 6. (a) Resolved 6—8 keV XRB fractions for Sample A 
sources in various stellar-mass bins. The number of sources (N) 
and the significance (in terms of cr) of the stacked signal in each 
stellar-mass bin are annotated accordingly. The horizontal dotted 
line indicates zero resolved 6-8 keV XRB fraction, (b) Same as 
panel (a), but for effective-color bins. Here the quoted significances 
are in general low due to the dilution of signal caused by sample 
splitting into many stacking bins (this also applies to Fig- [HJ - 

case of Fig. [6|), which suggests that, by applying our 
stellar-mass and effective-color cuts, we did not discard 
sources that have a relatively larger 4-6 keV flux. 

4. ROBUSTNESS OF STACKING RESULTS 

It is important to assess the robustness of our stacking 
results and the significance of our stacked signal. Our 
stacking strategy, i.e., selecting sources in M* and C e ff 
space, is strictly physically motivated, although the ex- 
act threshold values of M* and C e ff were chosen for a 
yield of strong signal. As a result, it is possible that 
the significance value of 3.9cx reported above is some- 
what overestimated. Therefore, we performed 1000 10- 
fold cross-validation tests (Efron & Tibshirani 1993; Ko- 
havi 1995; Davison & Hinkley 1997) to assess further the 
significance of our stacked signal. In each of the 10-fold 
cross-validation tests, we randomly split the data (i.e., 
Sample A) into ten subsamples of the same size. Taking 
one of the subsamples as the testing data, we used the 
rest of the subsamples as the training data upon which 
exploratory sample-selection criteria were utilized to find 
the best threshold values for M+ and C c s in a similar 
way to the construction of Sample These threshold 
values were then used for selecting sources from the test- 

22 We required that the grid in log M+-C e ff space used to find 
the threshold values be not finer than 0.3 dex for logM* or 0.05 
for C e ff, which are the typical errors on M+ or C c ff . 



ing subsample. The process was repeated with each of 
the ten subsamples being the testing data once, and in 
the end all the selected sources in these ten folds were 
combined for stacking, which provided one estimate of 
the significance of our stacked signal (<7 S ). The 10-fold 
cross-validation test was performed 1000 times, and a 
distribution for o~ s was obtained. The median value for 
a s is 3.3 (corresponding to a chance of p = 5.0 x 10~ 4 of 
the signal coming from pure Poisson noise). This value 
is likely to be slightly pessimistically biased because the 
effectiveness of the search for the best threshold M* and 
C c ff values is highly dependent on the training sample 
size, whereas in 10-fold cross validation tests 10% of the 
data were not used for training (Kohavi 1995). There- 
fore, the true significance of our stacked signal should be 
between 3.3<7 and 3.9c. 

We also performed several other robustness and consis- 
tency tests. For example, we stacked randomly selected 
sets of 6845 sources from Sample A. We repeated this 
procedure 10,000 times and find not a single case where 
the stacked 6-8 keV signal resolves more XRB or has 
higher significance compared to Sample D. This is con- 
sistent with our reported significance of (3.3-3.9)cr with 
p = (5.1-50) x 10~ 5 . We also stacked all the non Sam- 
ple D sources (i.e., 18035 — 6845 = 11190 sources) and 
find that the stacked 6-8 keV signal is, as expected, con- 
sistent with background (a O.ler signal). We furthermore 
investigated the effects of changes in the sample-selection 
criteria on our stacking results. We considered various 
combinations of stellar-mass and effective-color threshold 
values [e.g., M*, threshold varying between (l-5)xl0 8 M Q 
and the lower and upper C e g threshold values varying by 
±0.2, respectively] and obtained similar stacking results 
to that of Sample D in all cases. 

Finally, we have explored another purely physically 
motivated stacking strategy, which is formulated upon 
observation of the distribution of the three groups of ob- 
scured AGNs or AGN candidates in Fig. [5^. Given the 
locations of these obscured AGNs or AGN candidates in 
the figure, we selected sources lying within a diagonal 
stripe, 5.5 < 0.6 log M* — C c g < 7.0 (i.e., the region en- 
closed by the two parallel, diagonal, long-dashed lines 
in Fig. [5^). This stripe, running from the upper right 
corner to the lower left in Fig. [5^, is essentially the nar- 
rowest stripe containing all the obscured AGNs or AGN 
candidates except two "outliers" that are located around 
the C e ff ~ -0.4 and logM* > 11 area. There are 8415 
sources within this stripe, and they contribute 25.9±7.6% 
to the unresolved 6-8 keV XRB (a 3.5cr stacked signal). 



5. DISCUSSION 



5.1. 



General Properties of the Galaxies Hosting the 
Underlying Highly Obscured AGNs 



Given that the Sample D sources can account for the 
unresolved « 20-25% of the 6-8 keV XRB, it is of interest 
to determine what sources provide the majority contri- 
butions, i.e., what sources in Sample D are most likely 
to host "hidden" highly obscured AGNs. We therefore 
examined the resolved 6-8 keV XRB fractions for the 
Sample D sources in various redshift and z-band mag- 
nitude bins, as shown in Figs. [5^ and [5b, respectively. 
It appears that the galaxies with redshifts 1 < z < 3 
(see Fig. [8ji) and magnitudes z 85 o ~ 25-28 (see Fig. [8p) 
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Figure 7. Stacked X-ray spectrum (open circles) for the 6845 
sources in Sample D (the top x-axis shows the rest-frame photon 
energy at z = 1.6, which is the median rcdshift of the Sample D 
sources; see Table [TJ. The downward arrow in the 4—6 keV band 
indicates a 3<r upper limit. The solid curve is a schematic fit to 
the stacked X-ray spectrum, which is the sum of three components 
(each evaluated at z = 1.6): an unabsorbed power-law compo- 
nent accounting for star formation (dotted line; T = 2.0), a pure 
reflection component from the AGN (dashed curve), and a pure 
transmission component from the AGN (dashed-dot curve). Inset: 
Stacked, adaptively smoothed, 6-8 keV image, with the 3" diam- 
eter photometric aperture, the significance of the stacked signal, 
and the total stacked exposure shown. 
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vations. Marchesini et al. (2012) studied the rest-frame 
y-band luminosity function of galaxies at 0.4 < z < 4.0. 
Based on best-fit My values (for a Schechter luminos- 
ity function) in the different redshift ranges presented 
in their Table 2, we estimate that the above Sample D 
sources with z^q rs 25-28 at 1 < z < 3 typically have 
(0.05-0.10) Ly. Here we do not expect cosmic variance 
induced by large-scale structures (LSS) to affect the ba- 
sic redshift distribution observed in Fig. [5£i (thus affect- 
ing our basic results) in a significant way because of the 
following (also see § Q] for a brief discussion of cosmic 
variance): (1) All known prominent LSS in the CDF-S 
has z < 1 (i.e., zlss = 0.67 and 0.73; see, e.g., Silverman 
et al. 2010, and their Fig. 11 where small enhancements 
at other redshifts can also be seen); and (2) The broad 
rcdshift bins (Az = 1) that we considered in Fig. [S^i and 
the broad redshift range (1 < z < 3) where we find most 
of the signal should be, by design, relatively insensitive 
to the effects of cosmic variance induced by LSS. 

As described in § [3l three additional samples of highly 
obscured AGNs or AGN candidates are also shown in the 
effective color-mass diagram (Fig. [5^,), which are seen to 
be massive and relatively red. This motivates and sup- 
ports our utilization of the mass and color constraints as 
clues in identifying a source population (i.e., Sample D) 
responsible for the unresolved 6-8 keV XRB H Neverthe- 
less, the hosts of these highly obscured AGNs or AGN 
candidates are much more massive and redder than the 
Sample D sources (see Figs. [SMlt)- In particular, the 
stellar masses of the Sample D sources appear notably 
low (most having 2 x 10 s < M*/M < 2 x 10 9 ) with a 

median stellar mass of w 8 x 10 s Mq. However, for a 
typical star-forming galaxy with M* = 8 x 10 8 Mq at 
z = 1.6 (the median redshift of the Sample D sources), 
its stellar mass will grow by a factor of « 4-50 by 
the present day, which places its z — stellar mass 
at s» 0.1-1 times the stellar mass of the Milky Way 
(sw 5 x 10 10 M ; e.g., Hammer et al. 2007). The above 
estimate of mass-growth factor was made based on the 
calculations done by Leitner (2012) and equations 1 and 
21 in Peng et al. (2010) taking into account the effects 
of mergers and merger-induced quenching. This pre- 
dicted stellar-mass growth appears consistent with the 
fact that Lyman-alpha emitters with a typical stellar 
mass of ~ 10 8 -10 9 M Q at z ~ 2-3 are thought to grow 
into galaxies about as massive as the Milky Way by the 
present day (Gawiser et al. 2007; Guaita et al. 2010). 
Such a significant stellar-mass growth would imply a very 
large reservoir of gas present to sustain a large amount of 
star formation since z = 1.6. In addition to supporting 
star formation, this gas at z = 1.6 may also feed the su- 
permassive black hole (SMBH; explaining the common 
accretion likely present in Sample D) and obscure the 
SMBH (explaining the high obscured AGN fraction ap- 
parently seen in Sample D; see § 15.2.21 for details). Most 
of the Sample D sources are brighter than Zsso ~ 27, 



Figure 8. (a) Resolved 6-8 keV XRB fractions for Sample D 
sources in various rcdshift bins (cf. Fig. [6J) . (b) Same as Panel (a), 
but for z-band magnitude bins. 

make the major contributions to the unresolved 6-8 keV 
XRB, thus being more likely to host the highly obscured 
AGNs that escape from even the deepest Chandra obser- 



23 We note that the 6—8 keV signal is weak for the X-ray 
undetected sources with M* > 10 10 Mq (the resolved fraction is 
3.0% ± 2.8% with a = 1.1), which could potentially be the more 
heavily obscured counterparts of the X-ray detected highly ob- 
scured AGNs (see Fig. [fj,) . We speculate that there are simply 
not enough objects to produce a significant signal for such massive 
galaxies. 
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thus having reasonably good photometric coverage (over 
99% of the sources in Sample D have detections in at 
least 9 bands), so their photometric-redshift and stellar- 
mass estimates are of sufficient quality for our study 
(see § 12.11 and § 12. 3|) . The above results therefore imply 
that there are a significant number of highly obscured 
AGNs that are hosted by relatively low-mass galaxies 
(2 x 10 8 < M*/Mq < 2 X 10 9 ) at 1 < z < 30 

Such an AGN population might seem surprising given 
that the majority of the X-ray detected AGNs reside in 
massive galaxies. We thus discuss in § 15.21 constraints 
upon these underlying highly obscured AGNs and their 
parent population. 

5.2. Constraints upon Underlying Highly Obscured 
AGNs and Their Parent Population 

5.2.1. Spectral constraints 

At z = 1.6, which is the median redshift of the Sam- 
ple D sources, moderately Compton-thick obscuring ma- 
terial (JVh > 1.5 x 10 24 cm -2 ) would be required to ab- 
sorb X-rays strongly up to rest-frame ps 16 keV, but then 
permit higher energy emission to penetrate through the 
material. In this regime, performing an absorption cor- 
rection to derive a typical luminosity is difficult and ge- 
ometry dependent (and thus subject to large uncertain- 
ties). However, based on results for local Seyfert galax- 
ies (e.g., Guainazzi et al. 2000) and utilizing the MY- 
TORUS model (Murphy & Yaqoob 2009)0 we would 
expect typical column densities of JVh ~ 4 x 10 24 cm~ 2 . 
Due to the Compton-thick nature of the sources on aver- 
age, and the fact that there are surely many star-forming 
galaxies in Sample D, the stacked X-ray spectrum (with 
a measured r e g = 1.60 ± 0.16 that is derived from the 
stacked band ratio, 0.48 ± 0.08; see Table QJ will be af- 
fected by both Compton-thick AGN emission and star- 
formation emission. The quality of the stacked X-ray 
spectrum does not allow for a proper fit, but an illustra- 
tive fit is sufficiently useful for our purposes here, which 
is shown as the solid curve in Fig. [71 This is the sum of 
three components (each evaluated at z = 1.6): an unab- 
sorbed power-law representing the star-formation com- 
ponent (dotted line; the powerlaw model in XSPECF^I 
with r = 2.0), a Compton-reflection component from 
the AGN (dashed curve), and a transmission component 
from the AGN (dashed-dot curve). The latter two AGN 
components (reflection and transmission) were obtained 
with a MYTORUS model with JV H = 4 x 10 24 cm" 2 
(see Footnote [33] for the values of other parameters). It 
is clear that (1) the reflection component dominates the 
6-8 keV emission (cf. a composite, reflection-dominated 

24 The wording of "relatively low-mass" here means that the 
Sample D sources have low masses when compared to the afore- 
mentioned highly obscured AGNs or AGN candidates. They do, of 
course, still have high masses when compared to the large number 
of non-Sample D sources (see Figs. [5^, and [5b) . 

25 We used a model that consists of the transmitted continuum, 
the scattered (i.e., reflection) continuum, and no emission lines. 
In the model we adopted Y = 1.8, z = 1.6, an inclination angle of 
90° , and a varying Afjj (other parameters were fixed to their default 
values). We found that a column density of Wh ;s 4 x 10 24 cm -2 
leads to a ratio of ss 5 between the observed 6-8 and 4—6 keV 
energy output (represented by the EFg values). 

26 XSPEC is an X-ray spectral fitting package (Arnaud 1996) 
that is available at |http: / /heasarc. nasa.gov/xanadu/xspec/ 1 



spectrum for a sample of highly obscured AGNs at z ps 2 
obtained by Alexander et al. 2011; see their Fig. 5), being 
a factor of ~ 4 larger than the transmission component; 
and (2) the unabsorbed power-law component dominates 
the 0.5-4 keV emission. 

5.2.2. Constraints upon AGN fraction 

A quantity of interest is the AGN fraction (/agn) in 
a parent sample of galaxies. While the determination of 
/agn is challenging, there have been some previous at- 
tempts for, e.g., samples of galaxies that include X-ray- 
detected AGNs. For example, X10 estimated /agn ~ 
10% for moderate-luminosity (£o.5-8kcV ~ lO 419-437 
erg s _1 ) AGNs at z ps 0-3 in a parent sample of galax- 
ies with M* > 10 10 - 3 M Q . Recently Aird et al. (2012) 
studied a sample of 2-10 keV selected AGNs and their 
parent sample of galaxies that have 0.2 < z < 1.0 and 
3 x 10 9 < M t /M Q < 10 12 . They found that the inci- 
dence of AGN can be defined by a universal Eddington- 
ratio distribution that is independent of the host-galaxy 
stellar mass and has a power-law form with the slope 
being —0.65 and the normalization evolving strongly 
with redshift [ex (1 + z) 3 8 ]. Their results, if applica- 
ble down to lower mass galaxies and for AGNs up to 
higher redshifts, would yield an estimate of /agn ~ 10% 
for AGNs of L2-iokcV ~ lo 41-44 erg s _1 in galaxies with 
M* ps 10 9 M at z ps 1-3. 

The above estimates of /agn ~ 10% appear to satisfy 
the 6-8 keV non-detection requirement of individual hid- 
den AGNs in Sample D. The 6845 Sample D sources have 
a total stacked 6-8 keV flux of 3.5 x 10~ 14 erg cm~ 2 s _1 . 
Assuming this total flux is produced uniformly from a 
fraction /agn of the Sample D sources (corresponding to 
an observed sky density of ps 2.5 x 10 5 /agn deg -2 given 
a stacking area of 0.027 deg 2 ; see §H]), we obtain an aver- 
age 6-8 keV flux, 5.1 x 10~ 18 /^ G n er S cm " 2 s ~\ f° r tne 
hidden AGNs in Sample D. The on- axis 6-8 keV sensitiv- 
ity limit in the 4 Ms CDF-S is ps 2 x 10~ 16 erg cm" 2 s -1 . 
If it is assumed that these hidden sources are just below 
the 6-8 keV detection threshold, then the non-detection 
in this band requires 5.1 x 10~ 18 ./Xgn < 2 x 10~ 16 , i.e., 
/agn > 2.6%, indicating > 170 AGNs in Sample D. An- 
other estimate of a lower limit on /agn can be obtained 
through population-synthesis models. For instance, the 
Gilli et al. (2007) model predicts that there are ps 150 
obscured (i.e., JV H > 10 22 cm" 2 ) AGNs with 0.5-2 keV 
rest-frame intrinsic luminosities greater than 10 42 erg s _1 
not detected in the central 6'-radius area of the 4 Ms 
CDF-S, and that ps 30-50% of these missing AGNs are 
highly obscured (i.e., JVh > 3 x 10 23 cm' 2 ). This pre- 
dicted fraction of obscured AGNs, ps 150/6845 ps 2.2% 
for Sample D, is likely a lower limit since the population- 
synthesis model of Gilli et al. (2007) does not take 
into account low-luminosity AGNs (i.e., Lo.5-2kcV < 
10 42 erg s _1 ) that tend to be hosted by low-mass galax- 
ies. Therefore, the number of missing AGNs in Sample D 
could potentially be up to several hundred. 

In addition to /agn , another quantity of interest is the 
obscured AGN fraction. As discussed in § 15.2.11 the ma- 
jority of the underlying AGNs in Sample D need to be 
highly obscured in order to produce the steep 6-8 keV 
rise in the stacked Sample D spectrum; this result sug- 
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Figure 9. Same as Fig. [5^., but including the unobscured (i.e., 
having r c ff > 1) and obscured (i.e., having r o fj- < 1) AGNs in 
the central 6' area of the 4 Ms CDF-S. A small fraction of the 
AGNs are luminous (i.e., £o.5-8kcV > 10 43 7 erg s — 1 ), so the 
color and stellar-mass estimates of their hosts are subject to AGN 
contamination; however, this does not affect our discussion here 
(see text). 

gests that the obscured AGN fraction should be close to 
/agn for Sample D. We show in Fig.[9]the effective color- 
mass diagram for the unobscured (i.e., having r e g > 1) 
and obscured (i.e., having T c ff < 1) AGNs detected in the 
central 6' area of the 4 Ms CDF-S. It seems clear from 
the figure that the fraction of X-ray detected sources de- 
creases toward lower masses regardless of whether the 
X-ray sources are obscured or unobscured. In the stellar- 
mass range (2 x 10 s < M+/Mq < 2 x 10 9 ) where most of 
the stacked 6-8 keV signal lies, there are only about 10 
X-ray detected unobscured AGNs (and only 3 of them 
have 1 < z < 3). An order-of- magnitude estimate of 
the percentage of highly obscured AGNs among the un- 
derlying AGN population in Sample D would then be 
1 - [3/(6845 x /agn)] > 90%. This percentage of highly 
obscured AGNs appears high when compared to avail- 
able attempts at measurement of this quantity as a func- 
tion of X-ray luminosity and redshift (e.g., Treister & 
Urry 2 006; Hasinger et al. 2008; Gilli et al. 2010); see 
§ I5.2.3l for estimation of the typical X-ray luminosities of 
our sources. However, the available attempted measure- 
ments have significant systematic uncertainties owing to 
selection incompleteness, limited source spectral charac- 
terization, and other issues. Furthermore, as is clear 
from Fig. [5j we are investigating distant AGN activity 
in a quite different regime from that where the luminos- 
ity and redshift dependences of the obscured percentage 
have been studied, so past results may not be applicable. 

5.2.3. Constraints upon X-ray luminosity, black-hole mass, 
and host stellar mass 

As estimated earlier, we obtain an average 6-8 keV 
flux of 5.1 x 10- 18 /agn 

erg cm s for the hidden 
AGNs in Sample D. Adopting a reasonable absorption 
correction for a column density of = 4 x 10 24 cm~ 2 



within the MYTORUS model (see Footnote for the 
adopted model parameters), we estimate the average 
2-10 keV rest-frame intrinsic luminosity to be 4.0 x 
i0 42 /agn erg s _1 (assuming z — 1.6). Given the cor- 
responding average 2-10 keV rest-frame absorbed lumi- 
nosity 2.0 x 10 4 Vagn er g s_1 > the ratio between the ab- 
sorbed and intrinsic 2-10 keV luminosities is therefore 
(2.0 x 10 41 )/(4.0x 10 42 ) « 5.0%. This ratio for Compton- 
thick AGNs is strongly dependent upon the precise mea- 
surement of absorption, which is difficult and thus ren- 
ders this ratio uncertain (see, e.g., Comastri 2004 for a 
review) . 

Taking /agn = 10% and assuming a 2-10 keV bolo- 
metric correction of 10 (e.g., Vasudevan et al. 2009, 
2010; Lusso et al. 2011), we estimate the average bolo- 
metric luminosity of the highly obscured AGNs hidden 
in Sample D to be 4.0 x 10 erg s _1 , which implies that 
the masses of the relevant SMBH are > 3.1 x 10 6 M Q 
if they accrete at a sub-Eddington level. We assume 
M* ~ Afbuige for simplicity and obtain a correlation of 
M-bk/M* ~ 1/500 in the local universe based on the 
results of Marconi & Hunt (2003). There are studies 
indicating that the average SMBH to host-galaxy mass 
ratio evolves positively with redshift (e.g., Woo et al. 
2008; Merloni et al. 2010); however, such studies are 
subject to large uncertainties (e.g., Jahnke et al. 2009; 
Shen & Kelly 2010). Here we take an evolution form of 
Mbh/M* oc (1 + z) - 68 (Merloni et al. 2010) and obtain a 
correlation of Mbh/M* ~ (1/500) x (1 + 1.6) 68 w 1/250 
at z — 1.6, which then implies a typical stellar mass of 
M* > 7.8 x 10 8 M Q for the hosts of the highly obscured 
AGNs hidden in Sample D. This M* > 7.8 x 10 8 M Q con- 
straint is just consistent with the median stellar mass of 
Sample D (8.1 x 10 8 M Q ; see Tabled]). However, there will 
be a mis-match between the estimated SMBH mass and 
the typical host stellar mass, if a lower /agn value or a 
lower ratio between the absorbed and intrinsic 2-10 keV 
luminosities is assumed. 

5.2.4. Constraints upon star-forming galaxies 

As shown in Fig. [71 the 0.5-2 keV stacked X-ray 
emission of Sample D appears to be dominated by an 
unabsorbed power-law component that is likely associ- 
ated with star-formation related processes. The Sam- 
ple D sources have a total 0.5-2 keV flux of 9.8 x 10~ 15 
erg cm~ 2 s -1 ; the corresponding average 0.5-2 keV flux, 
1.43 x 10~ 18 erg cm~ 2 s _1 , is a factor of ps 6 be- 
low the on-axis 0.5-2 keV sensitivity limit in the 4 Ms 
CDF-S (XI 1). We estimated absorption-corrected fac- 
tors in the 0.5-2, 0.5-8, and 2-10 keV bands using 
a zpowerlwxzwabsxwabs model in XSPEC, where 
z = 1.6, T = 2.0, and intrinsic Ah = 10 20 cm -2 . We then 
estimated the average 0.5-2, 0.5-8, and 2-10 keV rest- 
frame intrinsic luminosities to be 2.6 x 10 40 , 5.2 x 10 40 , 
and 3.0 x 10 40 erg s _1 , respectively. Using these luminosi- 
ties, we obtained SFR estimates of 5.7 M Q yr _1 based 
on the Ranalli et al. (2003) relation between SFR and 
Lo.5-2 kov (see their Eq. 14), 21.8 M Q yr _1 based on the 
Lehmer et al. (2010) relation between SFR and L2-10 keV 
(see the fourth line in their Table 4), 19.9 Mq yr _1 based 
on the Mineo et al. (2012) relation between SFR and 
£0.5-8 keV (see their Eq. 22), and 4.2 M yr _1 based 
on the Vattakunnel et al. (2012) relation between SFR 
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and L2-X0 keV (see their Eq. 6), respectively, assuming 
that the mentioned relations also apply at lower stel- 
lar masses and higher redshifts. Based on the SFRs de- 
rived above and the mean stellar mass of the Sample D 
sources (3.2 x 10 9 M Q ), we estimate specific SFRs (sSFR, 
i.e., SFR per stellar mass) ranging from 1.3 Gyr~ to 
6.8 Gyr -1 . These estimated sSFRs are on the same or- 
der of magnitude as an estimate of 2.0 Gyr -1 made using 
Eq. 1 of Peng et al. (2010) that was obtained based on 
observations of typical star-forming galaxies at z « 0-2, 
with the input values of 3.2 x 10 9 M© and z = 1.6. We 
note that the above relations between SFR and X-ray lu- 
minosities are subject to large uncertainties, with typical 
scatters of 0.4-0.5 dex. 

5.3. Supporting Evidence for Relatively Low-Mass 
Galaxies Hosting Highly Obscured AGNs 

Our finding, that there is an appreciable fraction of 
relatively low-mass galaxies that host highly obscured 
AGNs at z « 1-3, is somewhat unexpected. Neverthe- 
less, there is already some supporting evidence, i.e., there 
are potential analogs both in the distant and nearby uni- 
verse. For example, Trump et al. (2011) identified appar- 
ent weak and/or obscured AGN activity in a sample of 28 
X-ray undetected, low-mass (M* iIne dian ~ 3 x 10 9 Mq), 
z s» 2 emission-line galaxies in the GOODS-S region PI 
suggesting that AGNs may be common in relatively low- 
mass star-forming galaxies at z ~ 2. Further near- 
infrared spectroscopic observations are needed to iden- 
tify larger samples of highly obscured AGNs in relatively 
low-mass galaxies at high redshifts. However, as demon- 
strated by Goulding & Alexander (2009) and Goulding 
et al. (2010), even in the nearby universe significant 
mass accretion onto SMBHs could be missed in the most 
sensitive optical surveys due to absent or weak optical 
AGN signatures caused by extinction. Locally, a recent 
study revealed a CTwmdra-detected, moderately obscured 
(Ah ps 6 x 10 22 cm" 2 ) AGN that may have M BH ~ 
2 x 10 6 Mq residing in a dwarf galaxy (Henize 2-10) with 
M* « 3.7 x 10 9 Mq (Reines et al. 2011). 

5.4. Future Prospects 

As discussed earlier, there are likely at least several 
hundred highly obscured AGNs hidden in Sample D. If 
we could better isolate this population of missing AGNs, 
we would be able to boost significantly the 6-8 keV 
stacked signal. One possibility for achieving a better 
stacked signal would be to obtain improved photometry 
that extends to the key rest-frame if -band or beyond. 
Such improved photometry will lead to more reliable 
stellar-mass estimates, which will consequently result in 
a more efficient sample selection and a likely boost in the 
stacked signal. The Mid-InfraRed Instrument (MIRI; 



Aird, J. et al. 2012, ApJ, 746, 90 

27 Of the Trump et al. (2011) sample of 28 galaxies, 26 (14) are 
included in our Sample A (Sample D) and the other 2 are not in- 
cluded due to their vicinity to the 4 Ms CDF-S sources. Of these 26 
(14) common Sample A (Sample D) sources, our estimates of z p hot 
and stellar mass for 18 (11) sources are in reasonable agreement 
with the Trump et al. (2011) estimates. 



Swinyard et al. 2004; Wright et al. 2004) onboard the 
James Webb Space Telescope (JWST; Gardner et al. 
2006) will be able to provide near- and mid-IR data that 
are greatly superior to the Spitzer IRAC data currently 
in use. The limiting magnitudes of MIRI (for the same 
length of exposure and the same signal-to-noise ratio) go 
deeper by over 2 magnitudes than those of IRAC. This 
means that all of our Sample A sources (compared to 
«60% currently) will have rest-frame A-band coverage 
or beyond with photometric quality significantly better 
than that at present. Other possibilities for increasing 
the stacked signal by, for instance, identifying AGN can- 
didates through morphologies or through deep optical 
and near-infrared spectroscopic observations (where the 
CANDELS imaging and JWST spectroscopic data would 
be most critical), remain of interest, but are beyond the 
scope of this work. 

One might think that future hard X-ray missions such 
as NuSTAR and ASTRO-H would be able to detect 
such highly obscured AGNs hidden in relatively low-mass 
galaxies. In the distant universe (z > 0.5), however, 
NuSTAR and ASTRO-H simply do not have sufficient 
sensitivity to make such direct detections (e.g., Luo et al. 
2011). In contrast, a 10 Ms CDF-S has the potential of 
detecting a fraction of such highly obscured AGNs if at 
least some of these hidden sources are not too far below 
the 6-8 keV detection threshold of the 4 Ms CDF-S. Fur- 
thermore, a 10 Ms CDF-S would also increase the signal- 
to-noise ratios of the stacking results and thus comple- 
ment the aforementioned approaches of improved sample 
selection. 
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